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analytic papers. Probably the greatest adv^antage is that it provides 
a leans by which the nature of the problem "^Y be handily suaaarized* 
The a-ethod o'f path analysis, although coiic^ived over sixty years ago 
by Sevell Wright, has only recently been introduced into educational 
literature. Its application in substantive analyses in edu<?^tio.n has 
also been infrequent, path analysis requires the researcher to ^think 
about systems of intercausal connections and requires a degree of 
explicitness desirable in scientific irriti^^. Path models may be 
classified into four categories: recursive* block, blopk-recursive, 
and nonrecursive. Each of these require^ a somewhat different 
strategy of analysis. The strategy for each kind of model is 
illustrated with substantive examples* The Bore difficult task of 
path analysis involves the construction of models that are consistent 
with sound theory. (Author/CTM) . ' ^ 
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♦ STRATEGIES OF. PATH 'ANALYSIS . ^ \ 

The method of path analysis, although conceived over sixty years . 
ago by Sewell Wrightr (1921, 1925), has only recently been iatroduced 
into the educational literature (Anderson and Evans, 1974;, Wolfle, 1977). 
Its application in substantive analyses in education has also been infre- 
quent, although sociolQgists have been using the method for over. a decade 
to analyze educational > attainment and its consequences (Blau and Duncan, 
1967; Hauser, 1971; Duncan, Featherman and Duncan, 1972; Sew»M and 
Hauser, 1975; Sewell, Hauser and Featherman, 1976). Several papers ^ 
have also summarized its methodological aspects; recent reviews include 
Goldberger (1972), Goldberger and Duncan (1973); Duncan (1975), Heise - 
(1975), and Bielby and Hauser (1977). These reviews obviate the need 
for an extensive review of the principles of path analysis.^ This 
paper assumes the reader's general familiarity with multiple regression 
analysis, and its similarity to. path analysis. Its purpose is to 
iTluminate the advantages of path analysis for the exposition of results 
in data analytic papers. 

A data analytic paper typically begins with a discussion that leads 
to an unresolved problem. Probably the greatest advantage of path 
analysis is that it provides a means by which the nature of the problem 
may be handily summarized. It requires the researcher to think about 
cause, particularly systems of intercausal connections, and provides an 
explicit link between a priori theoretical notions of causal connections 
and estimates of causal impact; The formulation of the model requires 
a degree of explicitness desirable in scientific writing, while requiring 
the researcher to defend the proposed model. 



Once thfe mgdel is formulated and defended, it provides a systematic 
guide for the epcposltlon of results. The analysis proceeds In accordance ^ 
with 'the logic of the model.. First, ^quantitative estimates ar4 attached 
to causal effects thoQght to^exist in the proposed model. But what the 
researcher can do next in the analytic process depends on the kind of 
model originally 'formulated. Path models^may be classified into four 
categories: recursive, block, block-recursive, and nonrecursive. Each of 
these requires a somewhat different strategy of analysis. The remainder 
of this paper presents the strategy of analysis for each kind of model, 
with substantive Examples. 
RECURSIVE EQUATION MODELS 

Recursive equation models have no feedback loops, either directly 

or indirectly; that is, the causal flow in the model i s unidirectional . 

The quantitative estimates atte^ched to the causal links may t)e either 

standardized or metric regression coefficients. In many cases the lat^r 

are preferred (see Duncan, 197?:51-66; Kim and Mueller, 1976), but for 

the instructive purposes of this papetr, standardized coefficients will ^ 

be used. Zero-order associations in models using standardized coefficients 

are measured by correlation coefficients. These may be decomposed into 

several components using the fundamental theorem of path analysis, which 

may be written (Duncan, 1966:5): 

r..=Ep.r. (1), 
13 q ^iq jq ^ 

where i and j^denote two variables in- the model, and the index q runs over 

all variables from which direct paths lead to X.. This decomposition 

provides the basis for interpreting the results^, and the kind of model 

dictates the interpretations that can be attached to the various components 

of the decomposition. 



T|^e strategy of analysis for recursive models is basically twofold. 
First, the researcher will want to obtain estimates of the extent to ^ 
wh'Ich intervening varfables account for relationships among variables. 
Thfese may be interpreted as indirect causal effects. Second, the 
res-earcher will want to dbtain estimates of the extent to which antecedent 
variables account for relationships between other variables. These may 
be interpreted as spurious effects, 

j/ie best way to illustrate'a strategy of analysis is to analyze a 
substantive example. Consider the following set, of structyral- equations : , 

\ ~- Pl2 ^2 Pl3 ^3 ^ Pl5 ^5 f Pl6 ^6 ^ P]?/? Plu " 

X2 = X3 + P24-x^ + P25 X5 V ' | 

""3 = P34 "4 * P35 '5 ^ P36-"6 ^ P37 ^7 * P38 ^8 * P3W^ 
Under the usual assumptions, the variables on the left-hand side of 
the equality are dependent upon 'tjjo^e to the right. The P^-j's are 
standardized regression coef ^jttients , known as path coefficients, in which 
the subscript i denotes the dependent variable. The residuals have expected 
values of zero, and expected covariations of zero with the other (Explicitly 
measured independent variables in the equation, and with each other. The 
three equations may be diagramed, as shown in Figure 1, 

In Figure 1 the straight arrows represent my hypothesis of causal 
effects; the arrowheads point toward the influenced variables. The arrow 
from to^X^j for example, represents the verbal statement,, "childhood 
intelligence is a c^use of educational attainment," 'or *'a change in 
childhood intelligence produces a change^ in educational attainment," 
A double-headed arrow represents a correlation between two variables. 
No causal interpretation is attached'to a correlation, I^n this case, the 
five variables ofi the left-hand side of the diagram are called exogenous 
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Figure 1. Recursive Path Model of Vocabulary Recognition, U.S. White 
Civilian Population,, A^e 25 to 72 • ^ 



variables, because the causes of these variables, whatever, they 

are, come from outside the model • These variables, it is recoqrrizetl, 

may be correlated* but for unknown reasoi>s. However, two correlations 

3re specified to be zero a priori, namely those between childhood in 

and seJ(, and childhood IQ and current ^^e, ^ 

A fully recursive model Is one in which every antecedent variable 
is allowed to affect every other 'variable subsequent to it/fn the model. 
The model presented in Figure T is not ful ly^r^cOrs,^' ve, because some of the 
possible causal connections have been elim/nated from the model. Some of 

0 

these were eliminated for a priori theoretical reasons. For example, 
the path from childhood ID to vocabulary recognition was assumed to be ^ 
nonexistent.^^^^f IQ affects vocabulary it must'do so at the time of 
the test; a person's past tO hars no direct bearing^ The paths from father* 
educational attainment and father's socioeconomic index to adult intelli- 
gence were similarly eliminated; the effects were assumed to exist in 
childhood, but not directly for adult IQ. Other paths were eliminated 
because the coefficients estimated from sample data failed tests of 
statistical significance. I therefore inferred that the di.rect causal 
path did not exist if^the population, removed' the path' from the model , 
and recomputed the coefficients of the remaining [^ths. The missing path 
from^sex to adult 10, for example*,, was eliminated because its coefficient 
was insignificant. 

The numbers along each path are standardized regressitfn coefficients 
obtained by regressing each criterion variable on the variables thought 
.to be causey of^.ij. The data were obtai(ied by combining pieces of evi- \ 
dence from diverse sources. Pending the completion of comprehensive 
and long-term longitudinal studies of truly representative samples, 
piecing together/Correlations from a variety of sources was the only way 



to obtain estimates for this model. Most of the correlations were 

■ , \V 

t)bta1ned from the Nations^! Opinion Research Center's 1976. general social 

survey. The correlations of variables with the two IQ measures were 
2 ' 

pieced together. 

Remember that the- strategy of analysis for recj^ive modfels is, 
first of All, to obtain estimates of the extent to which intervening 
variables account for relationships among variables. The relationship 
between age (Xg) and vocabulary (X^) will be used for illustration. 
Applying the fundamental theorem of path analysis, it can be seen that: 

•^15 = Pl2 '25 \^]3'3S ' Pl5 'sS ' Pl6 ^6 ' Pl7 ' SJ. - ' 
The correlation of a variable with itself is^ unity (^55,"^ 1.0], and the 
correlations r^g and r^^ are given as exogenous. The correlations/ r^g 
and ^35^nTay, however, in their turn be decomposed. Thus, . r 

•^25 ='P23 "^35 ' P24 '-^S ' P25 '5S ' . , " . ' 

■• "^35 = P34 "^45 \ P35 ^5 ' P36 P37 '57 ' P38 '58<: ' 

Equations 4 and 5 may be simplified by remembering that r^^ = 1.0 by 

^ 3 

definition, an^d r^g = 0 by theoretical assumption. Thus, 



25 " ^'23 ''35 ■' ^25 

"^35 = P35 ^ P36 "^56 ' P37/57 ' P38 "^58 ' 
The right-hand portion of equation 7 may be substituted into equation 
6 to yield: 

•^25 P23'^P35 ' P36 "^56/ P37 ^7^^38 '^58) ' P25 ' . 
The right-hand portions of ,jations 7 and 8 may- now be substituted 
into equation 3 to yield: 

•^15 = Pl2 fP23 >35 ' P36 "^56 ' ^31 '57 ' P38 ^8^ ' ^25^ 

' Pi 3 ^P35 ' P36 ^6 ' P37 "^57 ' P38 "^58^ 



' Pl5 ' Pl6 ^56." Pl7 ^57. • ' . ' ' 

Notice that equation 9 expresses the correlation between X, and 
solely in terms of path coefficients and correlations amon^ exogenous 
variable5. , Rearranging equation 9 will yield \he so-called reduced / 
form equation: . 

. • ''^S' Pl^ ' Pi 2 P25 ' Pi 3 P35 ' Pi 2 P23 P35. 

, ' Pi 6 "^56 Pl7 >7 Pi 3 (P36 "^56 ' P37 ^7 ' ^38 ^8^ 

" ' P12 P23 (P36 "^56 ' P37 '^57 ' P38 "53) ^^0)- 

\ 

1 

This equation provides the means by wh^ich the first strateq/ of analysis 

may be. realized, for each element oT the equation has^ substantive meaning 

The first element in equation 10, Pirj is the direct effect of age on 

lb 

vocabulary recognitiori. The coefficient is the number of standarcj ' 
deviations X-| changes when Xg increases one standard deviation, ceteris 
paribus. ^ 

The second, third and fourth elements of equation 10 are indirect 
causal effects. For example, P25 effect of age on vocabulary 

feco^nit^n .througti adult intelligence. In other words, age not only 
has a direct effect on vocabulary recognition, it also has an indirect 
effect because it affects adult intelligence which is itself a cause 
of vocabulary recognUlon, The product, p^^ ^35* f^^^stires the 
indirect .effect through educational attainment, and p^^ P23 P35 "measures 
the indirect effect through both education and adult intelligence. 

All of the remaining components in equation 10 contain in. them a 
correlation between exogenous variables. Previously I said that no, 
causal interpretation is attached to a correlation; therefore no causal 
interpretation is attached to a component of ^hp reduced-^form equation 
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which contains a correlation. These' components exist sirrtply bejcaus^ 
age' is correlated for unknown reasons with other causes of vocabulary 

■V, 

recognition. To, provide a collective label for these components, I will 
call them "joint associations. " . / 

The results of the decomposition may be presented in tabula'^ form. 
Table 1 represents one possibility. The total association between age 
and vocabulary is simply the zero-order correlation coefficient. T.he 
tota'l effect is the sum of the difect effect and all Indirect effec.ts; 
that is, the total effect of age on vocabulary is interpreted as the 
sum of all effects whether they occur directly or tfiriough intervening ^ 
variables. The other components; except for snurious effects wh|ch do . 
not*ap^^y In this case, have been discussed above. ^ 

*A substantive interpretation of these results is interesting, but 
somewhat problematic. In many situations the components are either all 
positive or all negative, and the components may be interpreted a 
percentage of the total association. However, in this case age S'eems - 
to operate as a suppressor variable, and all of the components 
larger in value than the total. I would interpret these results to 

\ * 

indicate that age has a strong direct effect on vocabulary--th^ older a 
person is, the longer his or her exposure to the language, and thus 
the greater is his or her vocabulary. At the same time, inteni9^'^c^ 
decreases with age (Wechsler, 19IS8), but 'intelligence has a positive 
effect on vocabulary recognition. As a result, age has a neqative 
indirect effect through intelligence. Aqe also has a neqative influence 

' • ■ i 

^ on educational attainment; this is probably a cohort effect wh^^^^^ the 
other relationships are effects of aging. In sum, the zero-border 
relationship of age *and vocabulary is negl iqible,^ut is cau^^ed by 



table 1. Decomposition of Association for Age and Vocabulary Recognition 



Variables 



Age and 

Vocabulary 



Type of Effect 



Decomposition 




) 

Total A^ociation 
. Total 

Direct Effect 
Indirect Effects 

1 ) through 

2) . through 

^^3) through X2 and 
.jointly 

Spurloi\s Effects 
f Joint Association 



-,01 
.06. 

^39 

r 

-.01 

' -.03 

not apDli cable 
-.07 



strong counterbalancing multivariate effects. * *' 

the second strategy of analysis for recursive models Is to obtain 
estlmatbs of the extenft- to which antecedenrt variables account for 
relatlon^h^jps^^^ other variables. These will tfe interpreted as 
spurious effects; The relationship between education (X^) and • 
vocabulary (X^) will be used for Illustration, Once again, the funda- 
mental theoriam is used to decompose the correlation .between the two 
•variables of interest. By substitution, the reduced-form equation 
expresses the decomposition in terms of path coefficients and corraTatioris 
betv^een exogenous variables. The Jntermediate steps are ]epr-for Inter- 
ested readert. The reduced- form equatioV is: 

'U ~- Pl3 ^ Pl2 P23.^ Pl5 P35 ^ P16 Pse ^ Pl7 P37 P12 P24 P34 - 

■ P12 P25 P35/ P12 P24 <P37^47 P38 "^48) 



* P12-P25 ^P36 "^56 ^ P37 "^57 ' P38 ^8^ Z 



^ Pl5 <P36/'S6 * P37 ^7 -^ Psa ^8^ 



* PI6 .^P35 "^56 ^ P37 •^67 -' P38 r'68) . 

^ Pl7'^P34 V P35 ^7 P36 "^67 P38 "^78^ 
The first component, P^3» of equation 11 is .the direct effect .of 
education on vocabulary recognition. The product, p^ 2 ^23* 
indirect effect of education on vocabulary through adult 10. The 
pjfijDduct, Pyg p^g* is the portion of the cor;^elation between education • 
and vocabulary that is due to the fact that both of these, variables. are 
directly caused by the antecedent variable, age. In common terms , pa rjt^ 
of the association between education and vocabulary is said to be 
spurioifsly caused by age. Actually, the product, p^^ P25 P35* 



a spurious measure of as it directly affects education and indirectly 
affects vocabulary through adult Itl. The product, p^g p^^^ is the 
spurious effect of sex on education and vocabulary, and p^^ p^^ is 
the spurious effect due, to father's education; 4)^^ P24 P34 the portion 
of the association between education and vocabulary spuriously due .to 

. childhood 'IQ as it directly aiffects education and indirectly affects 
vocabulary through aduVt IQ- All of .the remaining components of equation 
11 are joint associations, 

Table 2 presents the decomposition of the association between 

educational attainment and vocabulary recognition^^h direct effect 
of education on vocabulary (p^^ " -0^) surprisingly small given the 
amount of attention this effect has received (Bowen, 1977; Hyman, Wright 
and Reed, 1975; Harnqvist, 1977). Hyman, Wright and Reed's (1975) 
examination of the enduring effects of education concluded that education 
had strong effects on knowledge net of social origins. Their analysis 
showed that the partial correlation of education and a vocabulary item 
similar to the one used herein, controlling for social origins, was about 
80 perceflt of t^e zero-order coefficient (1975: 157). The data being 
analyzed here indicate that the partial correlation of education and 
vocabulary controlling for childhood IQ, age, sex, and father's education 
and socioeconomic index was .22, This value is only 43 percent of the 
zero-order coefficient. The difference between Hyman, Wright and Reed's 
results and mir)e probably lies in the ability of the present data to' 
cont^Qj for early IQ. At any rate, although the direct effect of edXica- 
tion is small, the total effect is considerably larger by virtue of 
education's effect on adult intel 1 l^^erice which in turn strongly affects 
vocabulary, recognition. The spurious effects of antecedent variables are 
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Table Decomposition of Association for Educational Attainment and 
Vocabulary Recognition 



Variables 



Type of Effect 



Decomposition Percentage 



Education 

and Vocabulary 



Total Association 
TQtal Effect 
Direct Effect 
Indirect Effect 

'^Spurious Effects 

1) by'X^ 

2) by 

3) by Xg 

4) by X/ 
Joint Association 



51 



.18 
■ .03 
.15 
.24 

.23 
-.02 
-.01 
.04 
.09 



inn.0% 

35.3 



47.1 



17^6 



/ 

/ 




particularly noteworthy. The present analysis points to the deficiency 
of previous studies in this area because they were not able to control 
for childhood IQ--a measure of the propensity for learning. Forty-five 
percent of the association between education and vocabulary may be said 
tO'be spuriously caused by childhood intelligence. Finally, a portion 
of the zero-order association may not be decoLmposed into components 
worthy of a; causal interpretation. About 18 percent of the association 
results from the correlation of exogenous variables with themselves for . 
unknown reasons. 

In explicating this model , I have for purposes of illustration 
decomposed only two associations of the niany that could have been con- 
sidered. Which associatj^ons should the researcher decompose by means of 
the reduced-form equations? Alwin and Mauser (1975) show how the 
reduced- form coefficients may be obtained by manipulating the equations 
in a computer. However, this begs th^ question of which associations 
are important enough to discuss, particularly when competition Yor journal 
space is considered. The answer lies in which associations are theoreti- 

s 

cally important. Decompose associations that illuminate the unresolved 
problem which initiated the exercise^ 

BLOCK EQUATION MODELS 

A bl6ck equation model is one in which each of a set of dependent 

variables is regressed on the same set of independent variables. For 
example, 

= Pi4 ^4 ' Pl5 ^5 ' Pl6 '6 + Pl7 ' hu " 

H ~- P24 '4 + P25 '5 ^ P26 ^6 + P?/.'? """27^ 

^3 P34 ^4 ' P35 ^5 ' P36 ^6 + P37 '7 "' Paw « (12). 
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These equations may be diagrammed, and are shown in Figure 2. The 
data were, taken from Sewell and Hauser (1975: 93), and pertain to ^ 
' Wisconsin male high school seniors of 1957 of nonfarm backgrounds. The 
three endogenous variables are the respondent's best friend's plans^ to 

» 

attend college (a duimiy variable: = 1 if yes, = 0 if no), parental 
^encouragement<:y'to attend college, and teacher's encouragement to attend 
college (also dummy variables). The four exogenous variables are 
fatherlip and mother's educational attainment, respondent's Henmon-Nelson 
IQ scor? recorded during their senior year, and the respondent's high 
school grades, recorded as their rank in high school class. 

The analysis goals for this model are (1) to compare the partial' 
path coefficients with their corresponding zero-order coefficients in 
order to determine how much of. the latter may be considered a direct 
, effect, and how much a joint association (note that this model does not 
^yield components interpretable as indirect or spup'ous effects); and (2) 
^ to examine the residuals for correlated errors. That is, the researcher 
employing a block equation model will want to determine how good a job 
the exogenous variables do of accounting for the correlations among the 
endogenous variables. To do so, the correlations of residuals are 
compared with the ,2ero-order correlations of the corresponding endogenous 
variables. 

To effect the analysis of residuals, the correlations between endogenous 
variables are decomposed by employing the reduced-form equations while 
allowing for the correlatiqn'Of their residuals. For example, using*the 
fundamental theorem of path analysis, the correlation between and 
may be decomposed: 

'^12 = Pl4''24^Pl5^25-^Pl6^26^Pl7^27-'Plu% • 
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Figure t. Block Equation Model of Encouragement to Attend College, 

Wisconsin Male High School Seniors, 1957 (Source: Sewe>+-dnd 
Hauser, 1975: 93) , ^ / 
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Stmllarly, the correlation of Y2 with u may be written: 

Substituting equation 14 into equation 13 yields, the reduced-form 
equation, 

'"IZ- = Pl4 "^24 ^ Pl5 '25 ^ Pl6 ^26 ^ Pl7 ^27 ^ Plu P2v '^u ^^^^ ' 

and < 

%u = (^12 - "24 ^ Pl5^"25 ^ Pl6 ^26^ Pl7 "27^^/Plu P2v ^l^^' 

In similar fashion, the correlations of r and r^ could be obtained, 

uw ' vw 

But notice that r^^ = r^^ 4567'' ^^^^ ^'^^ correlation of residuals 
V and u equals the fourth-order partial^ correlation of Y.j and Y^, con- 
trolling for X^, Xg, Xg, a^ Xy. With modern computers, the values for 
the correlated residuals may\)e obtained directly from a partial correlation 
routine. ' ^ 

Table 3 presents the decomposition ^ of the associations between the 
exogenous and endogenous variables. Because there are no intervening y 
variables, there can be no indirect effects. Because there are no 
antecedent vari allies, there can be no spurious effects. The direct effects 
are measured in the usual way; that is, they are standardized regression 
coefficients. Joint associations could be calculated by using the reduced- 
form equations, but in this special case are more easily obtained by 
subtracting the path coefficient from its associated correlation coefficient. 

Examining the effects upon parental encouragement to attend college, . 
-notice that the respondent's characteristics are the more important cause 
of parental encouragement, but parental characteristics are by no means 
unimportant. Moreover, for each of these variables about one-half of the 
association is a direct effect, while the remainder is due to the inter- 
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iT^ble 3. Decomposition of Associations for. Variables' in Block-Equation Model 

J ■ ' 





Total ' 


Direct 


Jo i n t 


Variables 


Association 


Effect 


Association 


PAED & F 


, .24 


(100%) 


13 (54%) 


.11 (46%) 


MAFD A F 


■ 21 


(100%)' 


no (435^). 


" 12 (57%) 


10 & F 


.29 


( 1 00%) 


12^41^) 


.17 (595^) 


ur a Ucb a r 


.31 


(100%) 


91 f fifty ^ 
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PAED & P 


.25 


(100%) ' 


.12 (48%)"* 


.13 (52%) 


MAED & P 


.23 


(100%) 


.10 (43%) 


.13 (57%) 


IQ & P 


.34 


(100%) 


.20 (59%) 


.14 (41%) 


Grades & P 


.32 


(100%) 


.17 (53%) 


.15 (47%) 


PAED. & T 


.15 


(100%) 


.04 (27%) 


.11 (73%) 


MAED & T 


.14 


(100%') 


.04 (29%) 


.10 {7U) 


IQ & T 


.35 


(100%)* 


.15. (43%) 


.20 {47%) 


Grades & T 


.42 


(100%) 


.32 (76%) 


.10 (24%) 
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^correlation (for unknown reasons.) of. the exogenous ^variables. My overall 
''impression is that parents who ^encourage their twelfth-grade children to 
attend college are nearly equally influenced by a variety of effects. On 
the other hand, teachers who encourage students to attend college are 
strongly influenced by the student's performance regclrdless K)f social 
background. To the'^e^tent that-social backgrouncj is associated with 

V.eacher's encouragement, it occurs primarily through correlations with 

^ther effects. For example, the total association of father's education 

iand teacher's encouragement may be decomposed: 

^34 = P34 ' P35 ^45 ' P36 P«7 ' \ 

= .04 + (.04) (.52) + (.15) (.25) + (.32) (.15) 
= .04 + .02 + .04 + .05 
\ = .15 \ ^ ^ (17). 

In plain English, most of the association of father's education and teacher' 
eticouragement is due to father's education being correlated with student's 
1(1 and school performance, which are themselves effects of teacher's 
encouragement. 

The second strategy of analysis for block equation models is to examine 
the residuals for correlated errors. We should on substantive .grounds 
expect students who are being encouraged to attend college by their parents 
also to be encouraged by teachers, and to have best friends whose plans 
include college. The correlation coefficients lend support to these 
expectations: i 
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Yet, once again cw substantive grounds, one wondeV'S whether these 
correlations result merely from the f^ct that they' have common antecedent 
causes. That is, the correlation of parental and teacher'si^encouragement 
could be- due to similar patterns by which parents and teachers assess 
the student's social background, ability, and performance. To see if 
this is so, we examine the correlation of the residuals, equivalent to 
the fourth-order partial correlation coefficient. If the residuals are 
uncorrelated, one would .conclude that the exogenous variables jointly 
account for the zero-order association. If the residual correlation 
coefficients are not zero, one would conclude that the model has not 
revealed all sources of covariation between the endogenous variables. 
'Indeed, that is what the correlation of errors would lead us to believe, 
for: 

^12.4567 = , ■ . 

r = 22 

r =33 ' (IQ) 

23.4567 ^ ^* 

Consider the zero-order correlation between parental encouragement • 

and teacher's encouragement (r^j = .44). Once again, decomposition 

permits causal interpretations to be attached to the components of the 

reduced-form equation: 

•^23 = P24 "^34 ' P25 "^35 ' ^26 ^36 ' ^27 '37 ' ^2, 'v^ ^^^^ ' 

and = .44 = .1 7 + .27 ' (21 ). 

where the first portion (.17) is the sum of products of direct effects 



and correlations, and the second portion (.27) is due to the effects 

of correlated errors. In other words, of the total association of teacher's 

and parental encouragement, 39 percent can be attributed to correlated 



Antecedent variables and 61 percent to correlated errors. In conclusion, 
it is cleat" that the correlation between teacher's and parental encouragements 
do not'result, solely from common antecedent causes; most^of tfie association 
is due to the cbrreTation of causes that have not been explicitly measured 
in the model . * , \ 

BLOCK-RECURSIVE MODELS \ ' . ' 

A block-r^cursi V€ equation ;model is, as the name implies, a model 
that combines both recursive and^bldck equations! The analysis goals are, 
therefor^ defined by what onevcan do with block and recursive equation 
models.^ First, the researcher will want to obtain estimates of the extent 
to which intervening variables account for relationships among variables. 
Second, one will wan|: to obtain estimates of the extent to which antecedent 
variables account for relationships between 'other variables. Third, the 
researcher wi 1 1 want |b examine where "appropriate, residuals for correlated 
errors. Finally, the researcher wijl want to determine the extent to 
which zero-order associations are accounted for by contemporaneous, endogenous 
variables. Let me emphasize here tha|, as always, the underlying goal of 
analysis, is. the explication of the unresolved problem which motivated the 
analysis. The analysis thus proceeds in accordance with the logic of the 
model. In the example to follbw, each of the four analytic goals will 
be discuj^sed, but the reader should remember that in substantive models 
the strategies of analysis make sense only^^in relation to the theoretical 
purpose of the model . 

Consider 4hQk following block-recursive equation model: 

^1 = Pl2-^2 Pl3 ^3 ' Pl4 ^ ^ ^15 '5 ' S ' Plu " 
^2 = P24 ^^4 ' P25 S ' P26 ^6 ' P2v ' 

^3 = P34 ^4 P35 ^5 ' P36 ^6 ' P3w « (22) ♦ - 



in which E{uv). = E(uw)\= 0, by assumption, but no assumption is required 

about the association of the residual's of and X^. The model is diagrammed 

in Figjjre 3. The. data are fictitious-, they were published in Kerlinger 

and Pedazur's (1973:331-3) text, and are used here for purposes of illustration. 

■The dependent variable* is verbal achieve^heot , and the exogenous " 

variables are race, mental ability, and school quality. Two variables are 

thought to be causally intervening between verbal achievement and the 

exogenous variable^ These are the sociop^chol ogical variables, self- 

,conc6pt and level of aspiration. While it rri^kes goolfi sense to consider 

f-concept and level of aspiration dependent^upon r^ce, IQ, and school 

quality, it is not clear to me that causality can be unambiguously 

established between the two sociopsychological variables. For my purposes, 
\ 

it IS not nec^^sary to establish the causal order, because my interest is 
\^r^mr^^y in the extent to which they account for the associations between 
/ verbal achievement and the three exogenous variables. Therefore, I 
specified no causal link between self-concept and aspiration, but do 

consi dec the possibi 1 i ty.of correlated errors. Of course, if the causal 

V ■ ■ ■ 

association lls^tween these variables was of theoretical interest, the modet^ 
would require "reformulation . 

To illustrate the first s.trategy of analysis for block-recursive 
models> consider the association between IQ and verbal achievement. The 
fundamental theorem re^als: ^ 

^5 = Pl2 '25 ' Pl3 '35 ^-^14 ^45 ' h 5 '55'' P16 ^56 ^^S). 
and ^ substitution for r^^ and r^^, the reduced-form equation is: 

^'15 = Pl5 ^ P12 P25 ' Pl3 P35 ' Pl4 '45 ' P16 56 

' P12 (P24 '45 ' P26 '56^ ' ^13 (P34 '45 ^ P36 '56^ (24). 
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Figure 3. Block-Recursive Equation Model of Verbal Achievement; 
(Source: Kerlinger and Pedhazur, 19:^3! 331) 



• 23 
In nimierlc terms, 

""iS.^ '^^ ' -51 + .(?! + .03 + .05 (25). 

Thus, 85 ^Percent of the association between IQ and verbal achievement 
was. a direct effect, and. only seven percent may be said to have occurred 
through the tw^ in^erveninq variables, self-concept and level of as.pi^tion 

, The Second strategy of analysis is tp determine the extent to which 
antecedent variables account for relationships between other variables. ' 
^Consider the association between sel f-concept and verbal achievement. The 
decoruposition of the correlation is: 

'12 Pl2 ^22 ' Pl3 ^23 ' P\l4 ^24 ' ^15 ^25 ^ Pig ^26 ^^G). 
The correlation r^^ = 1 vp by definition, but each of the remaining 
corre latiofis niay themselves be decomposed by means of the fundamental 
theorem. Decompositions of the three corefelations , r_-, r_p, and r„^, 
are strai gf^t forward (the interested readers may insert these intermediate ^ 
decompositions). The' correlation of and my be written: 

^23 = P24 ^"34 ^ P25 '3S \^26 ^36 ^ ^2^ 'v3 (27), 

'23 = P24 ^34 ' P25 ^25 ' ^26 ^6 ' ^2A ^^35 ^5,^ ^36 ^4^ 
' '^25 (^34 Us ' ^36 ^6^ ' "26 ^^34 ^46 " ^35 ^6^ 
♦P2vP3w'vw (28). 
Therefore, the reduced- form equation for .the correlation of X^ and is: 

"12 ^ t>12 ' f'l3 ^24 P34 ' Pi 3 ^25 ^35 ' ^13 ^26 P36 ' Pl4 P24 
• ' Pl5 P25 • Pl6:P26 ' Pl3 "24 ^P35 ^45 ' P36 ^46^ 

' "13 P25 ("34 ^4e, ' "36 UC? ' "13 "26 ^^34 ^46 ' "35 ^56^ 

' "14 (P« '^45 ' "26 ^46^ ' "iS ^"24 ^45 ^P26 ^6^ 

' "16 ("24 ^46 " "25 ^56^ "13 "2v P3W '"vw ^^^^ ' 



The direct effect is given by ^ (which is 37 percent of -^^J- 
Th<} next six components on the right-hand side of equation 29 measure the 
extent to which the antecedent variables X^, X^, and Xg account!^ for the 
relationship between X^ and X2. Th'us, a portion of the relationship 
between X^ and is spuriously due to three exogenous variables as they 
directly affect self-concept and verbal achievement, and indirectly 
affect verbal achievement through level of aspiration (X^). In numeric , 
terms, the sum of these six components is .08, or 26 percent of the total, 
which answers the question addressed in this second st^^t^^ of analysis. 

The third strategy of analysis for block-recursive models is to 
examine the residuals for correlated errors. In specifying the model, 
no assumption was made about the correlation of residuals for X^ and 
X^, It is now appropriate to compare the correlation of residuals to 
the zero-order correlation; we do so to' determine the extent to which 
race,'IQ, and school quality determine the association of self-concept 
and level of aspiration. The. reduced-form equation for^ the zero-order 
Correlation of X^ and X^ has already been shown in equation 28. The 
zerp-order coefficient was .40. and equation 28^ suggests that race, IQ, 
and school quality directly and jointly account for 35 percent (.14) 
of the zero-order association, and the correlated errors account for the 
remaining 65 percent (p2^ Pj^ = -27). In other words, most of the 
association between these two sociopsychological variables is caused 
by variables not explicitly included in the model. ; * . 

The final strategy of analysis is to determine the extent to which^ 
zero-order associations are accounted for by contemporanepus endogenous 
variables. For example, focus on self-concept (X2), and on the zero-order 
relationship between self-concept (X2) and verbal achievement (X^); 

\ 



how much of this relationship Is. accounted for by the contemporaneous 
var^le, level of aspiration (X3,)? That Is, once we aamit that 
and have correlated errors, we must. also admit the possibility that 
the zero-ordiBr correlation of X, and X^ includes an effect attributable 
to the correlated errors of and X^. This effect is shown in equation 
29 as the product, p^^ p^ r^, which' equals the value, .045, or 
15 percent ^^^^ " -30. 

NONRECURSIVE MODELS 

The models considered aboj^e were causally unidirectional. However, 
there are t]n\$s when a researcher's formulation of a model dictates 
that one variable thought to be a cause of another, is also thought, 
to be caused by it., either directly or indirectly. In other words, an 
independent variable in one equation is, in another equation, regressed 
on the dependent variable in the first equation. In such situations, 
estimates are no longer obtainable through ordinary least squares procedu 

In the area of education, nonrecursive, reciprocal effects were 
hypothesized by Duncan, Haller, and Portes (1968) between respondent's 
plans to attend college and the plans of respondent's best friend (see 
Hout and Morgan, 1975). For purposes of i 1 lustration i however, I would 
prefer to consider the simplest possible nonrecursive nx)del of four 
variables. However, substantive examples of such models are ra'r^. 
Erlanger and Winsborough (1976) used such a model to explain the sub- 
culture of violence thesis (Wolfgang, 1958). But rather than repeat 
their already cogent example, I prefer to construct another.- It is 
outside the area of education, and somewhat contrived, yet I hope 
instructive. 



ti potentially viable political party (Wolfle, 1976), There was strong 

support for tlhe party among miners, one-crop farmers, lumbermen, some 

immigrant groups, and others. The party's presidential ticket was 

♦ 

headed by Eugene V,. Debs, an attractive vote-getter for the Socialists, 
Yet one Is eventually led to ask whether the party's electoral fortunes 
were due to the popular presidential candidate, dr whether <the presidential, 
candidate merely gained support from voters attracted to the party's 
basic philosophy. ^ 

,To effect this analysis^ consider the model diagrammed in Figure 
4(a)i The four variables^ are the Illinois presidential SPA vote in 
1908 and 1912, and the Illinois SPA vote for the State Treasurer in,the 
same elections. The latter are taken as proxies for structural support 
for the SPA; the SPA State Treasurer candidate was seldom a factor in 
the vote cast for the party. The model indicates that the presidential 
and state treasurer vote in 1908 are correlated for unknown reasons. 
The curved line in the diagram has no arrowheads to caution us that the 
decompositions employed in previous tnodels do not apply to nonrecursive 
models. The presidential vote in 1912 is hypothesized to depend on the 
Candidate's vote-gathering ability in the last election, and the party's 
structural support in both the current Election and the last. At the 
same time, the party's structural support is hypothesized to depend upon 
the party's basic support four years earlier, and also the presidential 
vote in both 1908 and 1912. The equations implied by the model are: 

= Pl2 ^2 Pl3 ^3 Pl4 ^4 Plu " 

^^2 ' P2T P23 ^3 P24 ^4 ' 



-ifrthotft^jetiboHng'^the-'potirtt't 

Oversimplifying a bit, there. are only six intercorrelations supplied 
by the data, yet the equations have eight unknowns. The model is said 
to be unidentified (see Duncan, 1975). 

In order to obtain estimates for the model, some prior restraints . 
must be placed on it. This is accomplished by constraining some of the 

n[jaths^ to^ be zerov Wh i ch ones depencLupon Ihe substantive questions ^being 
asked of the data. If the researcher, is primarily interested in the 
cross-lagged effects, then the reciprocal effects are specified zero, 
and the model may be analyzed 'as a set of block equations. If the 
researcher is primarily interested in the reciprocal effects, then the 
cross-lagged effects are specified zero, such a^ shown in Figure 4(b). 
Thus, the strategy of analysis for nonirecursi ve models depends upon 
the substantive questions that motivated the analysis. The researcher 
must choose between models by going beyond the statistical ynformation 
available. If the reciprocal effects are of theoretical interest, 
then the^ strategy of analysis is to cortipare the size of the reciprocal 
effects to each other, and to other independent variables. 
The model to be estimated is: 

==Pi2^2^Pl3^3^Plu" , 

X2=P2i ^i^P24^4^P2v "; ' 
To estimate the parameters of this model ordinary least squares procedures 
are not appropriate. A one-step method has been known for some time 
(Goldberger, 1964), but most social science computer software packages 
do not include these two-stage least squares procedures. However, with 
certain precautions ordinary least squares regression may be employed in 
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two stages. I estimated theToefftcients for thi s mod el by us i n g 

SPSS (Nie, et aK, 1975), and adjusting the second stage output for 

Inherent computational errors (Hout, 1977). 

The results are shown in Figure 4(b). I am led to conclude that. 

the presidential vote in 1908 was relatively more important.^than the , 

philosophy-based support in 1912 (;58 versus .42) for the presidential 

vote in 1912, 'For the state treasurer vote in 1912, the philosophyr 

based support in 1908 was relatively more important than the presidential 

vote in 1912 (.63 versus .37). In answer to the question, which was 

the more important source or electoral support in 1912, the presidential 

candidate's popularity of the party's basic philosophy, the answer seems 

to be that these effects were about equal. There did, indeed, seem to 

be reciprocal effects-^the party received votes onythe coattails of 

Eugene Debs, and Debs received votes from people attracted to the party, 

not necessarily its candidates. 

CONCLUSION 

Duncan reminded us that "the study of structural equation models 
can' be divided into two parts: the easy part and the hard part" (1975: 
149). This paper has been about an easy part-^how a researcher can 
manipulate algebric equations to assist in the interpretation of an 
existing model. The hard part is constructing models that are consistent 
with sound theory. To incorporate sound- theoretical ideas in structural 
equation models requires a measure of creativity that goes far beyond 
the bounds of this paper. As Duncan beforeme suggested, if I knew how 
to construct creative, sound models, I wouhM have already done so. 

The models that I have presented in this paper to exemplify strategi 
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of analysis are therefore no better than the ideas that wen^t Into them.. 
If the models were based on unsound ideas, the models are unsound. Yet 
I find path models extremely- useful , because they force me to formulate 
my Ideas In Explicit form, and allow me to read the idpas of others in 
explicit form. I see this as the most important advantage of path 
analysis. 

Readers who would like to gain further insight into the construction 
and analysis of models may begin with the collections of Blalock (1971), 
Goldberger and Duncan (1973), and Sewell, Hauser and Featherman (1976). 
The American Sociological Review often includes articles that incorporate 
in them structural equation models, and readers may find these instructive 
Yet ideas and models are often flawed, and the comment section of the 

V 

journal contains very instructive dialogs between authors and critics. 
I recommend these discussions to you, and you might begin with, for 
example. Havens and Tully (1972) , Halaby (1973) , Featherman (1973), 
Hannan, Freeman and Heyers (1976) , Alexander and Griffin (1976), or 
Bohrnstedt (1977). 

. ■ . . - X 
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FOOTNOTES 



For those who are unfamiliar with path analysis there are several 
good introductions* • Land's (1969) article is good, but like many early, 
papers on the subject glosses over the distinction between sample estimates 
aiid population parameters. Moreover, Land (as well as Kerlinger and 
Pedhazur [1973:316]) il 1-advisedly replicated Duncan's (1966) defective 
conception of measuring indirect effects. A more proper consideration 
of indirect effects may be found in Duncan (1971) or Finney (1972). « 
Another introduction to path analysis may , be found in Wolfle (1977), 
and Duncan's (1975)' textbook is the best in the field. 

2 . 

A complete description of these data may be found in Wolfle (forth- 
coming). 

3 ^ ' 

In fact it is the population correlation coefficient, py.r, which 

is assumed equal to zero. When dealing wi^th sample^data the assumed^ 
equality will not hold exactly. Setting r^^ and r^g to zerb is therefore 
a practical determination to be distinguished from the formaV tenets of 
statis.tics. 

The complete equation would be: ' 

, ' ^2u = P2v %u P24 '^4u P25 '^Su ^ P26 '^u ^ ^21 ""lu ' 
Nt' ^4^^ " ' ^5^ " ^7^ = 0 by assumption, i.e., E(X:jU) = 0,* and the 

•full equation reduces to the form expressed in t^e textl- 
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